DeepSeek R1, a breakthrough AI model using reinforcement learning, achieves human-level reasoning and matches OpenAI's o1-1217 performance